# Zero-shot Voice Cloning

Spark TTS 0.5B
Spark-TTS is an efficient text-to-speech system based on large language models (LLM), supporting bilingual synthesis in Chinese and English with zero-shot voice cloning.
Speech Synthesis Supports Multiple Languages
S
unsloth
116
3
Spark TTS 0.5B
Spark-TTS is an advanced text-to-speech system based on large language models, capable of high-precision and natural-sounding speech synthesis.
Speech Synthesis Supports Multiple Languages
S
prince-canuma
20
1
Aurora 1.6b
Apache-2.0
A multilingual emotional and singing voice synthesis model fine-tuned based on Dia-1.6B, supporting voice cloning and emotion control
Speech Synthesis Supports Multiple Languages
A
Lorenzob
103
3
Orpheus
Apache-2.0
A cutting-edge speech large model based on the Llama architecture, designed for high-quality, empathetic text-to-speech generation
Speech Synthesis Transformers English
O
atharva27
20
0
Openf5 TTS Base
Apache-2.0
OpenF5 TTS is an open-source text-to-speech model trained on the F5-TTS framework, supporting zero-shot voice cloning functionality, released under the Apache 2.0 license for commercial use.
Speech Synthesis English
O
mrfakename
391
43
Orpheus 3b 0.1 GGUF
Apache-2.0
A high-quality text-to-speech model based on Llama architecture, supporting emotion control and real-time streaming
Speech Synthesis Supports Multiple Languages
O
Prince-1
423
0
Orpheus Exl2 4bit
Apache-2.0
High-quality text-to-speech model based on Llama architecture, supporting emotion control and voice cloning
Speech Synthesis Transformers English
O
YaTharThShaRma999
21
3
Orpheus 3b 0.1 Ft
Apache-2.0
High-quality text-to-speech model based on Llama architecture, supporting emotion control and voice cloning
Speech Synthesis Transformers English
O
chutesai
2,686
0
Orpheus 3b 0.1 Ft
Apache-2.0
A cutting-edge voice large model based on the Llama architecture, designed for high-quality, empathetic text-to-speech generation
Speech Synthesis Transformers English
O
audo
240
1
Zonos V0.1 Transformer
Apache-2.0
Zonos-v0.1 is a leading open-weight text-to-speech model trained on over 200,000 hours of multilingual speech data, delivering expressiveness and quality comparable to or even surpassing top-tier TTS service providers.
Speech Synthesis
Z
Isi99999
30
0
Cosyvoice2 0.5B
CosyVoice is a text-to-speech (TTS) model that supports multilingual and voice conversion capabilities, providing high-quality speech synthesis.
Speech Synthesis
C
FunAudioLLM
4,573
114
GPT SoVITS V1 Base
MIT
GPT-SoVITS (V1) is a multilingual text-to-speech foundation model supporting Chinese, English, and Japanese.
Speech Synthesis Supports Multiple Languages
G
None1145
20
1
Cosyvoice 300M SFT
CosyVoice is a text-to-speech (TTS) model that supports multilingual and multi-style voice synthesis.
Speech Synthesis
C
FunAudioLLM
1,768
13
Voicecraft 330M TTSEnhanced
VoiceCraft is a PyTorch-based text-to-speech model supporting high-quality speech synthesis.
Speech Synthesis Safetensors
V
pyp1
105
1
Voicecraft 830M TTSEnhanced
VoiceCraft is a PyTorch-based text-to-speech model that supports high-quality speech synthesis.
Speech Synthesis
V
pyp1
148
8
Voicecraft Giga330m
VoiceCraft is a PyTorch-based text-to-speech model that supports high-quality speech synthesis.
Speech Synthesis
V
pyp1
1,188
0
Metavoice 1B V0.1
Apache-2.0
MetaVoice-1B is a 1.2 billion parameter text-to-speech (TTS) foundation model trained on 100,000 hours of speech data, specializing in generating emotional English speech with support for voice cloning and long-form synthesis.
Speech Synthesis English
M
metavoiceio
571
785
Kinyarwanda YourTTS
An end-to-end deep learning-based Kinyarwanda TTS system supporting zero-shot learning, requiring only 1 minute of speech to introduce a new voice.
Speech Synthesis Transformers Other
K
DigitalUmuganda
23
2
Kinyarwanda YourTTS V1
CC
This is a deep learning-based end-to-end Rwandan text-to-speech (TTS) system with zero-shot learning capability, requiring only 1 minute of speech to introduce a new voice.
Speech Synthesis Transformers Other
K
DigitalUmuganda
15
3
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase